Thiès Region
An Senegalese Legal Texts Structuration Using LLM-augmented Knowledge Graph
Kane, Oumar, Allaya, Mouhamad M., Samb, Dame, Bousso, Mamadou
Abstract--This study examines the application of artificial intelligence (AI) and large language models (LLM) to improve access to legal texts in Senegal's judicial system. The emphasis is on the difficulties of extracting and organizing legal documents, highlighting the need for better access to judicial information. The research successfully extracted 7,967 articles from various legal documents, particularly focusing on the Land and Public Domain Code. A detailed graph database was developed, which contains 2,872 nodes and 10,774 relationships, aiding in the visualization of interconnections within legal texts. In addition, advanced triple extraction techniques were utilized for knowledge, demonstrating the effectiveness of models such as GPT - 4o, GPT -4, and Mistral-Large in identifying relationships and relevant metadata. Through these technologies, the aim is to create a solid framework that allows Senegalese citizens and legal professionals to more effectively understand their rights and responsibilities. Artificial intelligence (AI) is a transformative technology that raises significant ethical considerations regarding its use. Initiatives like Microsoft's "AI for Humanitarian Action" and Google's "AI for Social Good" focus on enhancing jurisprudence and human rights [1]. Moreover, the Center for Social Good Data Science at the University of Chicago applies AI to improve criminal justice systems.
- North America > United States > Illinois > Cook County > Chicago (0.24)
- Europe > France (0.14)
- Africa > Senegal > Thiès Region > Thiès (0.04)
- Africa > Senegal > Dakar Region > Dakar (0.04)
Voice of a Continent: Mapping Africa's Speech Technology Frontier
Elmadany, AbdelRahim, Kwon, Sang Yun, Toyin, Hawau Olamide, Inciarte, Alcides Alcoba, Aldarmaki, Hanan, Abdul-Mageed, Muhammad
Africa's rich linguistic diversity remains significantly underrepresented in speech technologies, creating barriers to digital inclusion. To alleviate this challenge, we systematically map the continent's speech space of datasets and technologies, leading to a new comprehensive benchmark SimbaBench for downstream African speech tasks. Using SimbaBench, we introduce the Simba family of models, achieving state-of-the-art performance across multiple African languages and speech tasks. Our benchmark analysis reveals critical patterns in resource availability, while our model evaluation demonstrates how dataset quality, domain diversity, and language family relationships influence performance across languages. Our work highlights the need for expanded speech technology resources that better reflect Africa's linguistic diversity and provides a solid foundation for future research and development efforts toward more inclusive speech technologies.
- North America > United States (0.05)
- Africa > Zambia (0.04)
- Oceania > Tonga (0.04)
- (22 more...)
- Overview (1.00)
- Research Report > New Finding (0.46)
- Leisure & Entertainment (1.00)
- Media > Radio (0.67)
Kallaama: A Transcribed Speech Dataset about Agriculture in the Three Most Widely Spoken Languages in Senegal
Gauthier, Elodie, Ndiaye, Aminata, Guissé, Abdoulaye
This work is part of the Kallaama project, whose objective is to produce and disseminate national languages corpora for speech technologies developments, in the field of agriculture. Except for Wolof, which benefits from some language data for natural language processing, national languages of Senegal are largely ignored by language technology providers. However, such technologies are keys to the protection, promotion and teaching of these languages. Kallaama focuses on the 3 main spoken languages by Senegalese people: Wolof, Pulaar and Sereer. These languages are widely spoken by the population, with around 10 million of native Senegalese speakers, not to mention those outside the country. However, they remain under-resourced in terms of machine-readable data that can be used for automatic processing and language technologies, all the more so in the agricultural sector. We release a transcribed speech dataset containing 125 hours of recordings, about agriculture, in each of the above-mentioned languages. These resources are specifically designed for Automatic Speech Recognition purpose, including traditional approaches. To build such technologies, we provide textual corpora in Wolof and Pulaar, and a pronunciation lexicon containing 49,132 entries from the Wolof dataset.
- Africa > Senegal > Dakar Region > Dakar (0.05)
- Africa > Niger (0.04)
- Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
- (15 more...)
- Banking & Finance (0.93)
- Food & Agriculture > Agriculture (0.92)
- Media (0.69)
Humans Beat Deep Networks at Recognizing Objects in Unusual Poses, Given Enough Time
Ollikka, Netta, Abbas, Amro, Perin, Andrea, Kilpeläinen, Markku, Deny, Stéphane
Deep learning is closing the gap with humans on several object recognition benchmarks. Here we investigate this gap in the context of challenging images where objects are seen from unusual viewpoints. We find that humans excel at recognizing objects in unusual poses, in contrast with state-of-the-art pretrained networks (EfficientNet, SWAG, ViT, SWIN, BEiT, ConvNext) which are systematically brittle in this condition. Remarkably, as we limit image exposure time, human performance degrades to the level of deep networks, suggesting that additional mental processes (requiring additional time) take place when humans identify objects in unusual poses. Finally, our analysis of error patterns of humans vs. networks reveals that even time-limited humans are dissimilar to feed-forward deep networks. We conclude that more work is needed to bring computer vision systems to the level of robustness of the human visual system. Understanding the nature of the mental processes taking place during extra viewing time may be key to attain such robustness.
- Europe > Finland > Uusimaa > Helsinki (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- North America > Canada > Newfoundland and Labrador > Labrador (0.04)
- Africa > Senegal > Thiès Region > Thiès (0.04)
- Health & Medicine > Therapeutic Area > Neurology (1.00)
- Leisure & Entertainment > Sports (0.68)
Analysis of COVID-19 evolution in Senegal: impact of health care capacity
Fall, Mouhamed M., Ndiaye, Babacar M., Seydi, Ousmane, Seck, Diaraf
We consider a compartmental model from which we incorporate a time-dependent health care capacity having a logistic growth. This allows us to take into account the Senegalese authorities response in anticipating the growing number of infected cases. We highlight the importance of anticipation and timing to avoid overwhelming that could impact considerably the treatment of patients and the well-being of health care workers. A condition, depending on the health care capacity and the flux of new hospitalized individuals, to avoid possible overwhelming is provided. We also use machine learning approach to project forward the cumulative number of cases from March 02, 2020, until 1st December, 2020.
- North America > United States (0.14)
- Africa > Senegal > Dakar Region > Dakar (0.05)
- Africa > Senegal > Diourbel Region > Diourbel (0.04)
- (4 more...)
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
- Health & Medicine > Therapeutic Area > Immunology (1.00)
- Health & Medicine > Epidemiology (1.00)